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4 <110> APPLICANT: Kosan Biosciences, Inc. 

5 Julien, Bryan 

7 <120> TITLE OF INVENTION: TRANSFORMATION SYSTEM BASED ON THE 

8 INTEGRASE GENE AND ATTACHMENT SITE FOR MYXOCOCCUS XANTHUS 

9 BACTERIOPHAGE MX 9 

11 <130> FILE REFERENCE: 300622009940 

13 <140> CURRENT APPLICATION NUMBER: US 10/645,818 

14 <141> CURRENT FILING DATE: 2003-08-20 

16 <150> PRIOR APPLICATION NUMBER: US 60/405,196 

17 <151> PRIOR FILING DATE: 2002-08-21 
19 <160> NUMBER OF SEQ ID NOS : 20 

21 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 1647 

25 <212> TYPE: DNA 

2 6 <213> ORGANISM: Bacteriophage MX 9 

2 8 <4 00> SEQUENCE: 1 

29 gtggcgctca ggggtgcgtc ggacgccact accaacccct ctcgacttgt gcagtccgtc 60 

3 0 gccgccggcc cgcgtgcgac tccgtggggt gtcagtgcgt cgtggtacct gctagggcgt 12 0 

31 acagcaacgg gggagtacat cgtgagtagc gacgcggcga agaagggcca tccaatggca 180 

32 actgcggcgg agcggttgcc gacgtcacca atcgacgtca acgctctggc gctggaggtg 240 

33 gcccggcttg tggccctcca gcagcaaagt gcgacgccgc catcgtccgg ccgcactttc 300 

34 ggcgcggtgg cggatgactg gctcatcact gaggccaagc gcctcgtgtg ccccgacaat 3 60 

35 gagcgccgcc atcttcgcca tatggaggcg ctctggggca tgacggatgt ggagctcacg 420 
3 6 ccgcgcgtcg tgaaggcgca cctggcggga cttctcaagc cagaggggcc gctgagcgca 480 

37 gccaccgtca ataaggtgcg ctctaccggc aagcgcatca tcaaggcggc gcaaatcaac 540 

38 ggcgagtggg gcccggtgaa tcctttcggc gtgctcgacc gcgaaaaaga ggcgaaggcc 600 
3 9 gagcgcctca cgctgacggc agcggagtgc cgggcggtgc tcccgcactt ccgcgcggac 660 

40 cggcgccgcg agtttctctt ccaggtcttt ctggggccac gccccggcga agagaaggcg 720 

41 ctcctcaagg aagatgtgga cgtcgaggcg cgcaccgtca ttttccggcg cagcaatgga 780 

42 cgagacacga caaagacggg acgcgagcgt cgcgtgccgg tgccggatga gttgtggccc 84 0 

43 gtgctcctcg atgcgatgca ggccagtccg tctgacctcg ttttcccgaa cgcgaagggt 900 

44 gagaggcagc gcgcagacac gaagatgacg cgcgtgctgc gcactgcgct atccgcggct 960 

45 ggtgtcgtgg tgggctggga ttacatctgc cgcacgcagg gctgcggcta ccgagatgtg 102 0 

46 cagtctggtg gcgcgcgcca ggagcgtcgg tgccccgcct gcgacaagcg catgtgggcc 1080 

47 agtggtcgcc ccaaacccgc cgtctggtac gggctccgtc acaccgcggc gacactgcac 1140 

48 aggaaggcgg gctgcgaccc gctcgtcatc aagctcgtgc tggggcatgc ggctgtcgac 1200 

49 accacggacg acgtgtacac gcacctcgac gaggactact gccgcgccga acttaacaag 1260 

50 ttgtcgctga aggccccgcc gccaccacct actcaccagg gaggaagtga cggcggccct 1320 

51 gactcaggac gcaacaccta cggtgaagga ggcaccatgc acggattggg agatttgcag 1380 

52 catcaccggg cgagagcttg ggaagctcgt gctctaccaa ctgagctacc accgcggaac 1440 

53 ttggccgggg gtataccggc gccgctgctg agcgtcaagg acgttgcggc ttcactctca 1500 

54 gtgagcacgg cgaaggtgta ccagctcctc gccgccggcg tcctgcctac cgtgtgggtg 1560 
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55 ggccagtcgc gccgcgtcaa gcgtgaggac ctggacgcct acatcgcccg cgcgacggcc 1620 
5 6 accggcggga agcggggtgg caaatga 164 7 

58 <210> SEQ ID NO: 2 

59 <2il> LENGTH: 548 

60 <212> TYPE: PRT 

61 <213> ORGANISM: Bacteriophage MX 9 

63 <400> SEQUENCE: 2 

64 Val Ala Leu Arg Gly Ala Ser Asp Ala Thr Thr Asn Pro Ser Arg Leu 

65 1 5 10 15 

66 Val Gin Ser Val Ala Ala Gly Pro Arg Ala Thr Pro Trp Gly Val Ser 

67 20 25 30 

68 Ala Ser Trp Tyr Leu Leu Gly Arg Thr Ala Thr Gly Glu Tyr lie Val 

69 35 40 45 

70 Ser Ser Asp Ala Ala Lys Lys Gly His Pro Met Ala Thr Ala Ala Glu 

71 50 55 60 

72 Arg Leu Pro Thr Ser Pro lie Asp Val Asn Ala Leu Ala Leu Glu Val 

73 65 70 . 75 80 

74 Ala Arg Leu Val Ala Leu Gin Gin Gin Ser Ala Thr Pro Pro Ser Ser 

75 85 90 95 

76 Gly Arg Thr ^ Phe Gly Ala Val Ala Asp Asp Trp Leu lie Thr Glu Ala . 

77 ' 100 ' 105 110 

7 8 Lys Arg Leu Val Cys Pro Asp Asn Glu Arg Arg His Leu Arg His Met 
79 • 115 120 125 

8 0 Glu Ala Leu Trp Gly Met Thr Asp Val Glu Leu Thr Pro Arg Val Val 

81 130 135 140 

82 Lys Ala His Leu Ala Gly Leu Leu Lys Pro Glu Gly Pro Leu Ser Ala 

83 145 150 155 160 

84 Ala Thr Val Asn Lys Val Arg Ser Thr Gly Lys Arg lie lie Lys Ala 

85 165 170 175 

86 Ala Gin lie Asn Gly Glu Trp Gly Pro Val Asn Pro Phe Gly Val Leu 

87 180 185 190 

88 Asp Arg Glu Lys Glu Ala Lys Ala Glu Arg Leu Thr Leu Thr Ala Ala 

89 195 200 205 

90 Glu Cys Arg Ala Val Leu Pro His Phe Arg Ala Asp Arg Arg Arg Glu 

91 210 215 220 

92 Phe Leu Phe Gin Val Phe Leu Gly Pro Arg Pro Gly Glu Glu Lys Ala 

93 225 230 235 240 

94 Leu Leu Lys Glu Asp Val Asp Val Glu Ala Arg Thr Val lie Phe Arg 

95 245 250 255 

96 Arg Ser Asn Gly Arg Asp Thr Thr Lys Thr Gly Arg Glu Arg Arg Val 

97 260 265 270 

98 Pro Val Pro Asp Glu Leu Trp Pro Val Leu Leu Asp Ala Met Gin Ala 

99 275 280 285 

100 Ser Pro Ser Asp Leu Val Phe Pro Asn Ala Lys Gly Glu Arg Gin Arg 

101 290 295 300 

102 Ala Asp Thr Lys Met Thr Arg Val Leu Arg Thr Ala Leu Ser Ala Ala 

103 305 310 315 320 

104 Gly Val Val Val Gly Trp Asp Tyr He Cys Arg Thr Gin Gly Cys Gly 

105 325 330 335 . 
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106 


Tyr Arg 


Asp Val 


Gin 


Ser 


Gly 


Gly 


Ala Arg Gin 


Glu 


Arg 


Arg 


Cys 


Pro 


107 






340 










345 








350 






108 


Ala 


Cys 


Asp Lys 


Arg 


Met 


Trp 


Ala 


Ser 


Gly Arg 


Pro 


Lys 


Pro 


Ala 


Val 


109 






355 








360 








365 








110 


Trp 


Tyr 


Gly Leu 


Arg 


His 


Thr 


Ala 


Ala 


Thr Leu 


His 


Arg 


Lys 


Ala 


Gly 


111 




370 








375 








380 










112 


Cys 


Asp 


Pro Leu 


Val 


He 


Lys 


Leu 


Val 


Leu Gly 


His 


Ala 


Ala 


Val 


Asp 


113 


3 85 








390 








395 










400 


114 


Thr 


Thr 


Asp Asp 


Val 


Tyr 


Thr 


His 


Leu 


Asp Glu 


Asp 


Tyr 


Cys 


Arg 


Ala 


115 








405 










410- 








415 




116 


Glu 


Leu 


Asn Lys 


Leu 


Ser 


Leu 


Lys 


Ala 


Pro Pro 


Pro 


Pro 


Pro 


Thr 


His 


117 






420 










425 








430 






118 


Gin Gly 


Gly Ser 


Asp Gly 


Gly 


Pro 


Asp 


Ser Gly 


Arg 


Asn 


Thr 


Tyr 


Gly 


119 






435 








440 








445 








120 


Glu 


Gly 


Gly Thr 


Met 


His 


Gly 


Leu 


Gly Asp Leu 


Gin 


His 


His 


Arg 


Ala 


121 




450 








455 








460 










122 


Arg Ala 


Trp Glu 


Ala Arg 


Ala 


Leu 


Pro 


Thr Glu 


Leu 


Pro 


Pro 


Arg 


Asn 


123 


465 








470 








475 










480 


124 


Leu 


Ala 


Gly Gly 


lie 


Pro 


Ala 


Pro 


Leu 


Leu Ser 


Val 


Lys 


Asp 


Val 


Ala 


125 








485 










490 








495 




126 


Ala 


Ser 


Leu Ser 


Val 


Ser 


Thr 


Ala 


Lys 


Val Tyr 


Gin 


Leu 


Leu 


Ala 


Ala 


127 






500 










505 








510 






128 


Gly Val 


Leu Pro 


Thr 


Val 


Trp 


Val 


Gly Gin Ser 


Arg 


Arg 


Val 


Lys 


Arg 


129 






515 








520 








525 








130 


Glu Asp 


Leu Asp 


Ala 


Tyr 


He 


Ala 


Arg 


Ala Thr 


Ala 


Thr 


Gly 


Gly 


Lys 


131 




530 








535 








540 










132 


Arg 


Gly 


Gly Lys 

























133 545 

136 <210> SEQ ID NO: 3 

137 <211> LENGTH: 360 
13 8 <212> TYPE: DNA 

13 9 <213> ORGANISM: Bacteriophage MX 9 

141 <400> SEQUENCE: 3 

142 gtgagctgac ctcaacggtt tgttgggtgg ggagcgggac agcggaccac atggtgccag 60 

143 ggcttacggc ttcgcacacg gggctgggcg atgctgaacg gagcgtccca tgtccacgcg 120 

144 atgccgcctg gcttgcacat agggattcga aacctcgacc ccgagcttgg gaagctcgtg 180 

145 ctctaccaac- tgagctacca ccgcaggcga agcagggcgc aaagtacggg ccgccctgtg 240 

146 gcttgtcaac gggaagtgag gtgctactcc gtctcctcga cggtgagctg gtacgagtcc 3 00 

147 tggaagttgg actcgcggtt gcgcgcgtcc cggacctcga agaggtagac gcctggctcg 360 

150 <210> SEQ ID NO: 4 

151 <211> LENGTH: 360 

152 <212> TYPE: DNA 

153 <213> ORGANISM: Bacteriophage MX 9 

155 <400> SEQUENCE: 4 

156 cgagccgggg acgggagcgg cgggaccggc ttcgcgccgt ttacagcatc cttgctgcaa 60 
15 7 gacgccccga ggcccgaaaa gacgaaggcc ggcagtcccg agtttcctca aggactaccg 12 0 

158 gccttcatgg gtgagcggcg gaagggattc gaaccctcga ccccgagctt gggaagctcg 180 

159 tgctctacca actgagctac caccgcaggc gaagcagggc gcaaagtacg ggccgccctg 240 

160 tggcttgtca acgggaagtg aggtgctact ccgtctcctc gacggtgagc tggtacgagt 300 
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161 cctggaagtt ggactcgcgg ttgcgcgcgt cccggacctc gaagaggtag acgcctggct 360 

164 <210> SEQ ID NO: 5 

165 <211> LENGTH: 42 
16 6 <212> TYPE: DNA 

167 <213> ORGANISM: Bacteriophage MX 9 

169 <400> SEQUENCE : 5 

170 gagcttggga agctcgtgct ctaccaactg agctaccacc gc 42 

172 <210> SEQ ID NO: 6 

173 <211> LENGTH: 240 

174 <212> TYPE: DNA 

175 <213> ORGANISM: Bacteriophage MX 9 
177<400> SEQUENCE: 6 

178 tgccagggct tacggcttcg cacacggggc tgggcgatgc tgaacggagc gtcccatgtc 60 

179 cacgcgatgc cgcctggctt gcacataggg attcgaaacc tcgaccccga gcttgggaag 120 

180 ctcggcctcg acccgtccag gcgttatcag ccgttcgcaa acccttactt cgccttgggg 180 

181 attccgggcc gggggcctgt ccatccgtcg cagcgggtag cagggagtct caggggggtt 240 

184 <210> SEQ ID NO: 7 

185 <211> LENGTH: 257 

186 <212> TYPE: DNA 

187 <213> ORGANISM: Bacteriophage MX 9 

189 <400> SEQUENCE: 7 

190 cgccaccacc tactcaccag ggaggaagtg acggcggccc tgactcagga cgcaacacct 60 

191 acggtgaagg aggcaccatg cacggattgg gagatttgca gcatcaccgg gcgagagctt 120 

192 gggaagctcg tgctctacca actgagctac caccgcggaa cttggccggg ggtataccgg 180 

193 cgccgctgct gagcgtcaag gacgttgcgg cttcactctc agtgagcacg gcgaaggtgt 240 

194 accagctcct cgccgcc 257 

196 <210> SEQ ID NO: 8 

197 <211> LENGTH: 20 

198 <212> TYPE: DNA 

199 <213> ORGANISM: Artificial Sequence 

201 <220> FEATURE: 

202 <223> OTHER INFORMATION: Synthetic Construct 
2 04 <400> SEQUENCE: 8 

2 05 gaaggaggca ccatgcacgg 20 
207 <210> SEQ ID NO: 9 
208-<211> LENGTH: 20 

209 <212> TYPE: DNA 

210 <213> ORGANISM: Artificial Sequence 

212 <220> FEATURE: 

213 <223> OTHER INFORMATION: Synthetic Construct 
215 <400> SEQUENCE: 9 



218 <210> SEQ ID NO: 10 

219 <211> LENGTH: 20 

220 <212> TYPE: DNA 

221 <213> ORGANISM: Artificial Sequence 

223 <220> FEATURE: 

224 <223> OTHER INFORMATION: Synthetic Construct 
226 <400> SEQUENCE: 10 



216 



ctcactgaga gtgaagccgc 



20 
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227 


cgaggtccgg 9 ac 9 c 9 c 9 ca 


20 


229 


<210> SEQ ID NO: 11 




230 


<211> LENGTH: 19 




231 


<212> TYPE: DNA 




232 


<213> ORGANISM: Artificial Sequence 




234 


<22 0> FEATURE: 




235 


<223>. OTHER INFORMATION: Synthetic Construct 




237 


<400> SEQUENCE: 11 




238 


tgccagggct tacggcttc 


19 


240 


<210> SEQ ID NO: 12 




241 


<211> LENGTH: 74 




242 


<212> TYPE: DNA 


■ 


243 


<213> ORGANISM: Myxococcus xanthus 




245 


<400> SEQUENCE: 12 




246 


gcggugguag cucaguuggu agagcacgag cuucccaagc ucggggucga 


ggguucgaau 60 


247 


cccuuccgcc gcuc 


74 


249 


<210> SEQ ID NO: 13 




250 


<211> LENGTH: 20 




251 


<212> TYPE: DNA 




252 


<213> ORGANISM: Artificial Sequence 




2 54 


<22 0> FEATURE: 




255 


<223> OTHER INFORMATION: Synthetic Construct 




257 


<400> SEQUENCE: 13 




258 


tatcccagca* accgccggag 


20 


260 


<210> SEQ ID NO: 14 




261 


<211> LENGTH: 18 




262 


<212> TYPE: DNA 




263 


<213> ORGANISM: Artificial Sequence 




265 


<22 0> FEATURE: 




266 


<223> OTHER INFORMATION: Synthetic Construct 




268 


<400> SEQUENCE: 14 




269 


cagcacgggt gcagcaac 


18 


271 


<210> SEQ ID NO: 15 




272 


<211> LENGTH: 28 




273 


<212> TYPE: DNA 




274 


<213> ORGANISM: Artificial Sequence 




276 


<22 0> FEATURE: 




277 


<223> OTHER INFORMATION: Synthetic Construct 




279 


<400> SEQUENCE: 15 




280 


cccaattggc tcagggcagc ggctcatt 


28 


282 


<210> SEQ ID NO: 16 




283 


<211> LENGTH: 31 




284 


<212> TYPE: DNA 




285 


<213> ORGANISM: Artificial Sequence 




287 


<220> FEATURE: 




288 


<223> OTHER INFORMATION: Synthetic Construct 




290 


<400> SEQUENCE: 16 




291 


ccccatggcg ctcaggggtg cgtcggacgc c 


31 


293 


<210> SEQ ID NO: 17 
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